Structured Near-Optimal Channel- Adapted Quantum Error Correction 
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We present a class of numerical algorithms which adapt a quantum error correction scheme to a 
channel model. Given an encoding and a channel model, it was previously shown that the quantum 
operation that maximizes the average entanglement fidelity may be calculated by a semidefinite 
program (SDP), which is a convex optimization. While optimal, this recovery operation is com- 
putationally difficult for long codes. Furthermore, the optimal recovery operation has no structure 
beyond the completely positive trace preserving (CPTP) constraint. We derive methods to generate 
structured channel-adapted error recovery operations. Specifically, each recovery operation begins 
with a projective error syndrome measurement. The algorithms to compute the structured recovery 
operations are more scalable than the SDP and yield recovery operations with an intuitive physical 
form. Using Lagrange duality, we derive performance bounds to certify near-optimality. 



PACS numbers: 



I. INTRODUCTION 



All physical implementations of quantum information 
processing systems must incorporate a scheme to miti- 
gate the effects of noise. The most common method for 
quantum error correction (QEC) is analogous to classical 
digital error correction schemes. The system of interest 
is encoded into a subspace of a larger quantum system by 
means of a quantum code. After passing through a noisy 
channel, a syndrome measurement projects errors onto 
orthogonal subspaces from which the original quantum 
state can be recovered. The first quantum error correct- 
ing codes demonstrated that such methods could cor- 
rect arbitrary single qubit errors 0, 0, ||. These generic 
methods enabled a whole range of study in quantum er- 
ror correction, particularly as it applies to fault-tolerant 
quantum computing. 

The generic approach has its drawbacks, however. 
Most notably, quantum codes impose a severe amount 
of overhead to correct for arbitrary errors. As an exam- 
ple, the shortest block code that corrects an arbitrary 
qubit error embeds one qubit into five0, Q. As scaling 
to many qubits is one of the principal barriers to building 
a working quantum computer, any efforts to improve the 
efficiency of error recovery are of great interest. 

Several recent efforts have explored an optimization- 
based approach to quantum error recovery^, 0, H, Q. 
In each case, rather than correcting for arbitrary single 
qubit errors, the error recovery scheme was adapted to 
a model for the noise, with the goal to maximize the fi- 
delity of the operation. In [a], a semidefinite program 
(SDP) was used to maximize the entanglement fidelity, 
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given a fixed encoding and channel model. In Q and 
encodings and decodings were iteratively improved us- 
ing the performance criteria of ensemble average fidelity 
and entanglement fidelity, respectively. A sub-optimal 
method for minimum fidelity, using an SDP, was pro- 
posed in Q . An analytical approach to channel-adapted 
recovery based on the pretty-good measurement and the 
average entanglement fidelity was derived in [io| . The 
main point of each scheme was to improve error correc- 
tive procedures by adapting to the physical noise process. 

As in [Qj, we choose to focus our channel-adapted 
efforts on the recovery operation. While channel- 
adaptation can be advantageous in both the encoding 
and the recovery operations, the optimization problem 
has a significantly nicer form when one of the two is held 
fixed. The numerical tools we develop can be used for 
either half the problem; focusing on quantum error re- 
covery (QER) operations illustrates nearly all of the im- 
portant numerical procedures. 

The optimization approach to quantum error recovery 
demonstrates the utility of channel-adaptivity. Such ef- 
forts have shown that quantum error correction designed 
for generic errors can be inefficient in the face of a partic- 
ular noise process. Since overhead in physical quantum 
computing devices is challenging, it is advantageous to 
maximize error recovery efficiency. 

Recovery operations generated through convex opti- 
mization methods suffer two significant drawbacks. First, 
the dimensions of the optimization problem grow expo- 
nentially with the length of the code, limiting the tech- 
nique to short codes. Secondly, the optimal operation, 
while physically legitimate, may be quite difficult to im- 
plement. The optimization routine is constrained to the 
set of completely positive, trace preserving (CPTP) op- 
erations, but is not restricted to more easily implemented 
operations. 

In this paper, we describe efforts to determine near- 
optimal channel-adapted quantum error recovery proce- 



2 



dures that overcome the drawbacks of optimal recov- 
ery. We impose an intuitively satisfying structure on 
the recovery operation and seek to optimize performance. 
While still numerical procedures, the result is a class of 
algorithms that is less computationally intensive than the 
SDP and which yields recovery operations of an intuitive 
and potentially realizable form. 



II. CHANNEL-ADAPTED RECOVERY 

To adapt quantum error recovery to a specific chan- 
nel model, we must first determine a measure of perfor- 
mance. As detailed in @, 0|, both entanglement fidelity 
(F e ) and ensemble average fidelity (F) yield convex op- 
timization problems. 1 As both measures may be of in- 
terest, we will use the average entanglement fidelity of 
ficj |. from which either of the above are a special case. 
Average entanglement fidelity is defined for a channel A 
and an ensemble E of states {pt} with prior probabilities 
Pi as 

F e (E,A) = ^ftFete,.*) - ^ItrO^I 2 , (1) 

i i,k 

where {Ak} are the Kraus elements for the CPTP map 
A. 
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FIG. 1: Quantum error correction block diagram. For 
channel- adapted recovery, the encoding isometry Uc and the 
channel £ ' are considered as a fixed operation £ and the re- 
covery 1Z is chosen according to the design criteria. 

Consider the simple block diagram for a QEC system 
given in Fig. [1] We begin with a fixed model, labeled 
to describe the physical noise process. In design- 
ing a QEC procedure, we can choose the encoding Uc 
and the recovery operation 1Z. By holding either the 
encoding or the recovery operation as fixed, optimizing 
the other can be cast as a convex optimization problem 
H, 0,3. As done in 0,3, one can iteratively optimize 
a encoding/recovery scheme. In this paper, we focus our 
attentions on the efficacy of adapting the recovery opera- 
tion and defer iterative optimization to subsequent work. 



1 Ensemble average fidelity yields a convex optimization problem 
if and only if the states in the ensemble are pure. 



We illustrate our emphasis on the recovery block by con- 
sidering both the encoding Uc and the channel £' as a 
combined operation £. 

It is useful to note the dimensions of the various oper- 
ations in Fig. [TJ We define two Hilbert spaces Hs and 
TLc, which refer to source and the code spaces, respec- 
tively. These have dimensions d$ and dc ■ The combined 
encoding and channel £ therefore maps density matri- 
ces in C(Hs) to C(Hc), where C(H) refers to the space 
of bounded linear operators on H. Our use of the fi- 
delity implies that 1Z maps from C(Hc) to C(Hs), i.e. 1Z 
performs a decoding. This is mostly for computational 
convenience as ds < dc- 

Channel-adapted recovery selects an operation 1Z to 
maximize F e (E,1Z o £). As shown in [6|, exact maxi- 
mization can be accomplished via the convex optimiza- 
tion routine of semidefinite programming (SDP). For the 
remainder of the paper, we will discuss routines to ap- 
proach the optimum channel-adapted recovery through a 
more computationally feasible method. In several cases, 
the routines also yield an intuitive form for the recovery 
operation. 

We will make use of a convenient isomorphism in which 
bounded linear operators are represented by vectors and 
denoted with the symbol |-)). While there are several 
choices for this isomorphism [III f]~3 ] . including most in- 
tuitively a "stacking" operation, we will follow the con- 
ventions of [HI (also @) which result in an isomorphism 
that is independent of the choice of basis. For conve- 
nience, we will restate the relevant results here. 

Let A = Y]jj aij \i) (j\ be a bounded linear operator 
from H to K {i.e. A e C{K,H)), where {\i)} and {\j)} 
are bases for K, and H, respectively. Let 7i* be the dual 
of TL. This is also a Hilbert space, generally understood 
as the space of bras If we relabel the elements as 
\j) = then we represent A as a vector in the space 
JC®H.* as 

\A)) = j2^\m- (2) 

ij 

It is useful to note the following facts. The inner prod- 
uct ((A\B)) is the Hilbert-Schmidt inner product trA^B. 
Also, the partial trace trjc\A))((B\ = AB^ . Finally, index 
manipulation yields the relation A <g) B\C)) — \ACB^)) 1 
where B is the conjugate of B such that B \tp) = B \ip) 
for all 

These relations lead directly to a convenient repre- 
sentation of a CPTP operation A : C(H) i— > £(/C) 
in terms of a positive semidefinite (p.s.d.) operator 
X A e C{K ® H*)[ll|, [H [3, [H, El. Th e p.s.d. oper- 
ator is calculated from the Kraus elements {Ak} of A 
as 

X A = "£\A k )){(-Akl (3) 
k 

We will refer to X A as the Choi matrix for A, although 
most derivations do not use the basis-free free double-ket 
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of ((2|). The operation output is given by A(p) — tin* I C3) 
pX_4 and the CPTP constraint requires that Xa > and 
tr K X A = I. 

In terms of the Choi matrix, the average entanglement 
fidelity can be written as F e (E,A) = J2iPi{{pi\Xy\\pi}}- 
From this expression, we can derive the dependence of 
F e (E,TZo£) on TZ as 

F e (E,TZo£) = Y,PM Xn ° M 

i 

i k 

= trX K C E , £ , (4) 

where Ce.s — J2ik Pi\P^k)) ((P^l\ encapsulates both the 
input ensemble E and the channel (with encoding) £ . 
It was shown in 0, 0] the the optimum X-ji satisfying 
the CPTP constraint can be calculated via semidefinite 
programming. 



III. EIGQER ALGORITHM 

To achieve a near-optimal QER operation, an algo- 
rithm must have a methodology to approach optimality 
while still satisfying the CPTP constraints. Furthermore, 
to ease implementation of such a recovery, we can impose 
structure to maintain relative simplicity. 

Let us begin by considering the structure of a stan- 
dard QEC recovery operation. QEC begins by defining 
a set of correctable errors, i.e. errors that satisfy the 
quantum error correction conditions. To correct this set, 
we construct the recovery operation by defining a pro- 
jective syndrome measurement. Based on the detected 
syndrome, the appropriate unitary rotation restores the 
information to the code space, thereby correcting the er- 
ror. This intuitive structure - projective measurement 
followed by unitary syndrome recovery - provides a sim- 
ple geometric picture of error correction. Furthermore, 
it is a relatively straightforward task to translate such a 
recovery operation into a quantum circuit representation. 

Let us impose the same constraint on the channel- 
adapted recovery operation. We construct an operation 
with operator elements that are a projective syndrome 
measurement followed by a classically controlled unitary 
operation. Thus the operator elements can be written 
{Rk = UkPk] where Pk is a projection operator. While 
we could merely constrain Uk to be unitary, we will in- 
stead continue with the convention that the recovery op- 
eration performs a decoding: TZ : £{Ttc) | — * C(Hs). Un- 
der this convention, Uk £ £(Wc,Ws) and Uj,Uk = I. In 
words, both Uf. and R k are isometries. 



The CPTP constraint 

i = Y. R i R * ( 5 ) 

k 

= J2 PkU t UkPk ( 6 ) 

k 

= Y. Pk ( 7 ) 

k 

is satisfied if and only if the projectors span Tic- To 
satisfy the CPTP constraint, therefore, {Pk} must parti- 
tion Tic into orthogonal subspaces, each identified with 
a correction isometry 2 Uk- 

Since the {Pk} project onto orthogonal subspaces, we 
see that RjRk = SjkPk- From this we conclude that 
are an orthogonal set and thus are eigenvectors 
of the Choi matrix X-j^. The eigenvalue Aft associated 
with \Rkf) is the rank of Pk and is thus constrained to 
be an integer. Furthermore, since Uk restores the k th 
syndrome to TLs, Xk < ds- 

We can conceive of a 'greedy' algorithm to construct 
a recovery operation TZ. The average entanglement fi- 
delity can be decomposed into the contributions of each 
individual operator element as ((Rk\C 'e ,s\Rk)) ■ We can 
construct TZ by successively choosing the syndrome sub- 
space to maximize the fidelity contribution. As long as 
each syndrome is orthogonal to the previously selected 
subspaces, the resulting operation will be CPTP and will 
satisfy our additional constraints. In fact, this greediest 
algorithm has no immediate method for computation; the 
selection of the syndrome subspace to maximize the fi- 
delity contribution has no simple form. We propose in- 
stead a greedy algorithm to approximate this procedure. 

We motivate our proposed algorithm in terms of eigen 
analysis. Let us assume for the moment that the rank of 
each syndrome subspace is exactly ds which is the case 
for QEC recoveries for stabilizer codes. By such an as- 
sumption, we know that there will be dc/ds recovery 
operator elements. Consider now the average entangle- 
ment fidelity, in terms of the eigenvectors of X-^: 

dc/ds 

F{E,Tlo£)= ((Rk\C E . £ \Rk))- (8) 
fc=i 

If we were to maximize the above expression with the 
only constraint being a fixed number of orthonormal vec- 
tors \Rk}}, the solution would be the eigenvectors associ- 
ated with the dc/ds largest eigenvalues of Ce,s- In fact, 
the actual constraint differs slightly from this simplifica- 
tion, as we further must constrain R k to be an isometry 
(i.e. RkR\ = I)- The analogy to eigen-analysis, how- 
ever, suggests a computational algorithm which we dub 



2 In fact, U^. is the isometry. For ease of explication, we will refer 
to Uk as an isometry as well. 
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'EigQER' (for eigen quantum error recovery). We use the 
eigenvectors of Ce.e to determine a syndrome subspace 
with a large fidelity contribution. 
The algorithm proceeds as follows: 

1. Initialize C\ = Ce,s- 
For the k th iteration: 

2. Determine \Xk)), the eigenvector associated with 
the largest eigenvalue of Ck- 

3. Calculate the isometry 'closest' to X\. via the 
singular value decomposition. Call R k an operator 
element of 1Z. 

4. Determine Ck+i by projecting out of Ck the sup- 
port of R k . 

5. Return to step 2 until the recovery operation is 
complete. 

The EigQER algorithm is guaranteed to generate a 
CPTP recovery operation, and will satisfy the criterion 
that it can be implemented by a projective syndrome 
measurement followed by a syndrome dependent unitary 
operation. 

Steps 2 and 3 in the above algorithm require further 
exposition. Given an operator X £ C(7ic,Hs), what 
is the closest isometry Rkl A straightforward answer 
uses the norm derived from the Hilbert-Schmidt inner 
product where ||A|| 2 = trj4JA We will now allow the 
rank of k th subspace to be dk < ds- 3 Thus RkRu = Id k 
where Id k is a diagonal operator with the 1 as the first 
dk diagonal matrix elements and for the rest. We have 
the minimization problem 



mintr(A - R k )HX - R k ) such that R k R\ = I dk 



We will state the solution as the following lemma. 



(9) 



Lemma 1. Let X be an operator with singular value 
decomposition X — ITSV'. The rank d isometry R that 
minimizes the Hilbert-Schmidt norm difference \\X — R\\ 
is given by R= UI c i I V' i . 

Proof. Let Ud be the set of rank d isometries; that is 
U d = {U\WU = I d }. We wish to find the R* £ U that 
minimizes tr(A — Ry(X — R). Since this can be written 
as 

tr(A - R) ] {X -R) = txX^X + tvR^R - tr(X^R + R^ X) 

(10) 



3 Inclusion of reduced rank subspaces may seem unnecessary or 
even undesirable - after all, such a projection would collapse su- 
perpositions within the encoded information. We allow the pos- 
sibility since such operator elements are observed in the optimal 
recovery operations of [f|. 



and tri?^ R = d, an equivalent problem is 
max tr(X + R + ltfX) = maxtr(FEL rt J R + i? t C/SV rt Xll) 

where we have replaced X with its singular value decom- 
position. 

We can simplify the above expression by noting that 
C< = WR G U. We can thus equivalently maximize the 
following expression over C eW: 

tr^EC 1 + CE^) = tiE(tfV + V^C) (12) 

d 

= ^2<ri(4 v i + v l c i) ( i3 ) 

i=l 
d 

- 2£> i Re{«M} (14) 

i=i 

d 

< 2j2°i\vh\ (15) 

i=l 
d 



< 2^V i |H|||c i || (16) 

i=l 
d 

(17) 

i=l 



= 2£"< 



In (fT3")) . o~i is the i largest singular value of X and Vi and 
Ci are the i th columns of V and C, respectively. We have 
used the fact that E is a diagonal matrix of the singular 
values in descending order. The inequality is saturated 
when Ci = Vi, which also implies that C = VId =>• R = 
UI d Vl □ 

One item not mentioned above is the determination of 
the desired rank d k ■ In our implementation of EigQER, 
this is accomplished by setting a relatively high threshold 
on the singular values of X. We only considered singular 
values such that a 2 > .05. This ad hoc value was chosen 
as it led to acceptable numerical results in the examples. 

We turn now to step 3 of the EigQER algorithm. Re- 
call that the CPTP constraint as written in J7]) requires 
that the syndrome subspaces are mutually orthogonal. 
Thus, the syndrome measurement for the k th iteration 
must be orthogonal to the first fc — 1 iterations: PkPi = 
for i < k. We satisfy this constraint by updating the data 
matrix Ck-i- 

To understand the update to C k -i, recall that the first 
step of the k th iteration is the computation of the dom- 
inant eigenvector \Xk)). To satisfy the constraint, we 
require that 

X k P % = o \X k Pi)) =1® Pi\X k )) = (18) 

for i < k. All | A)) for which this is not satisfied should 
be in the nullspace of Ck- Thus, after each iteration we 
update the data matrix as 



C k = (I - I ® Pk-i)C k -i(I - I ® P k -x). 



(19) 
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FIG. 2: Fidelity contribution of EigQER recovery operators 
for the amplitude damping channel (7 = .09) and the Steane 
code. Notice that the QEC performance is equaled with only 
8 operator elements, and the relative benefit of additional 
operators goes nearly to zero after 30. 



The algorithm terminates when the recovery operation 
is complete, i.e. ^ k R\Rk = Ylk ^ ~ I- Given the 
structure of the recovery operations, this can be deter- 
mined with a simple counter that is increased by d k at 
each step k. When the counter reaches dc, the recovery 
is complete. 

In fact, the greedy nature of EigQER allows early 
termination of the above algorithm. Each R k con- 
tributes ((Rk\CE,s\Rk)) to the average entanglement fi- 
delity. Since the algorithm seeks to maximize its gain 
at each step, the performance return of each Rk dimin- 
ishes as k grows. This is illustrated in Fig. [3J where 
we show the cumulative contribution for each recovery 
operator element with the Steane code and the ampli- 
tude damping channel. The greedy construction results 
in simplifications in both computation and implementa- 
tion. When the contribution ((Rk\C e ,s\Rk)) passes be- 
low some selected threshold, the algorithm may termi- 
nate and thus reduce the computational burden. This 
results in an under-complete recovery operation where 
J2k Rk^ k — ^ n under-complete specification for the 
recovery operation may significantly reduce the difficulty 
in physically implementing the recovery operation. In 
essence, an under-complete recovery operation will have 
syndrome subspaces whose occurrence is sufficiently rare 
that the recovery operation may be left as a 'don't care.' 

Before we consider examples of EigQER recovery per- 
formance, we should say a few words about the algorithm 
complexity when channel adapting an [n, k] code. The 
SDP of [3, 0] to calculate the optimal recovery operation 
has 4™ +fe complex optimization variables constrained to a 
semidefinite cone with a further 4 fe equality constraints. 



From [17], an SDP with n variables and a px p semidef- 
inite matrix constraint requires C(max{np 3 , n 2 p 2 , n 3 }) 
flops per iteration (with typically 10-100 iterations nec- 
essary). For our case, this yields 0(2^ n+k ^) flops per 
iteration. 

For the EigQER operation, the dominant computation 
is the calculation of | A^)) , the eigenvector associated with 
the largest eigenvalue of Ck- Ck is a 2 n+k x 2 n+k dimen- 
sional matrix, but the eigenvector has only 2 n+k dimen- 
sions. Using the power method for calculating the dom- 
inant eigenvector requires 0(2 2 ^ n+k ^) flops for each iter- 
ation of the power method. While both problems grow 
exponentially with n, the reduced size of the eigenvector 
problem has a significant impact on the computational 
burden. 

We should note that the eigenvector computation must 
be repeated for each operator element of 1Z. If we were 
to compute all of them, not truncating early due to the 
diminishing returns of the greedy algorithm, this would 
require iterating the algorithm approximately dc/ds — 
2«-fc times. In fact, we have a further reduction as the 
algorithm iterates. At the j th iteration we are calculating 
the dominant eigenvector of Cj which lives on a (dc — 
jds)ds = 2 k (2 n — j2 k ) dimensional subspace. We can 
therefore reduce the size of the eigenvector problem at 
each iteration of EigQER. 



A. EigQER examples 

To demonstrate the value of the EigQER algorithm, 
we consider several channels and codes; we would like 
to consider common codes and channels with non-trivial 
channel-adapted recoveries. It will be shown in the ap- 
pendix that channels represented by scaled Pauli group 
operators yield straightforward channel-adapted recovery 
operations; it is therefore useful to consider non-Pauli 
channels. The most common and useful such channel is 
the amplitude damping channel, which we will denote £ a . 
Amplitude damping was the example used in Q to illus- 
trate optimal QER, as well as the example for channel- 
adapted code design of [Hj]. The channel is a commonly 
encountered model, where the parameter 7 indicates the 
probability of decaying from state |1) to |0) (i.e. the prob- 
ability of losing a photon). For a single qubit, £ a has 
operator elements 



E n = 



1 

VI - 7 



and Ei 



V7 




(20) 



The EigQER algorithm does not require a channel as 
simple to model as the amplitude damping channel; the 
optimization routine is general to any channel. To illus- 
trate, we consider a qubit channel that is less familiar, 
though with a straightforward geometric description. We 
will call this the 'pure states rotation' channel and label 
it as £ ps . To describe the channel, we define a pure state 
by its angle in the xz-plane: \9) = cos#|0) + sin#|l). 
The channel mapping is defined by its action on two 
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pure states an angle 9 apart, symmetric about the z- 
axis. When \±8/2) is input to the channel, the result is 
|±(6> — (j))/2), also as a pure state. Thus, these two states 
are rotated toward each other by (f>. Any other state input 
to the channel will emerge mixed. The operator elements 
for this channel can be written as 



' cos ^ sin f ± cos ^ cos § 
± sm — J 1 * 1 sm 2 sm — ^f - cos ^ 



0.9 



0.8 



1 fTTp . 



where a and /3 are constants chosen to satisfy the CPTP 
constraint. 

The pure states rotation channel has multiple param- 
eters which characterize its behavior. 9 indicates the ini- 
tial separation of the targeted states. 0, the amount of ro- 
tation, clearly parameterizes the 'noise strength' as <f> = 
indicates no decoherence while (j) = 6 is strong decoher- 
ence. Furthermore, we have chosen the target states to be 
symmetric about the z-axis, but this is only for clarity in 
stating the channel; any alternate symmetry axis may be 
defined. Furthermore, a similar channel with asymmetric 
rotations 4>i and </>2 may be defined. This, however, cor- 
responds to a symmetric channel followed by a unitary 
rotation. While less physically motivated than amplitude 
damping, the pure state rotation channel model provides 
an extended set of qubit channels which are not repre- 
sented with Pauli group operator elements. We will look 
at examples of this channel where 9 = 5tt/12. There is no 
particular significance to this choice; it merely illustrates 
well the principles of channel- adapted QEC. 

Since the EigQER algorithm is more computationally 
scalable than the SDP, we can consider channel-adapted 
QER for several codes. We compare the EigQER recov- 
ery performance to the optimal channel-adapted recov- 
ery performance for the 5 qubit stabilizer code [J, |5(. We 
also compare the EigQER performance for the 5 qubit 
codejthe 7 qubit Steane code[l,[l|, and the 9 qubit Shor 
code[l|. All comparisons consider an ensemble E of qubit 
states that are in the completely mixed state p = 1/2. 

Figure [3] compares the performance of the EigQER 
algorithm to the optimal QER recovery for the case of 
the five qubit stabilizer code and the amplitude damping 
channel. Also included are the generic QEC recovery and 
the entanglement fidelity of a single qubit acted upon by 
£ a (i.e. no error correction performed). From this ex- 
ample we observe that the EigQER performance nearly 
achieves the optimum, especially for the values of 7 be- 
low .4. For higher 7, the EigQER performance begins to 
diverge, but this is less important as that region is one 
in which even the optimal QER lies below the fidelity of 
a single qubit obtainable with no error correction. 

Figure 0] compares EigQER and optimal QER for the 
five qubit stabilizer code and the pure state rotation 
channel with 9 — 57r/12. We see again that the EigQER 
algorithm achieves a recovery performance nearly equiv- 
alent to the optimum, especially as the noise level ap- 
proaches 0. 
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FIG. 3: EigQER and Optimal QER for the amplitude damp- 
ing channel and the five qubit stabilizer code. EigQER nearly 
duplicates the optimal channel-adapted performance, espe- 
cially for lower noise channels (small 7). 




FIG. 4: EigQER and Optimal QER for the pure state ro- 
tation channel with 6 = 57r/12 and the five qubit stabilizer 
code. EigQER nearly duplicates the optimal channel-adapted 
performance, especially for lower noise channels (small (f>). 



Figure[5]demonstrates the performance of several codes 
and the amplitude damping channel. We compare the 
EigQER performance for the five, seven, and nine qubit 
codes, contrasting each with the generic QEC perfor- 
mance. Notice first the pattern with the standard QEC 
recovery: the entanglement fidelity decreases with the 
length of the code. The five qubit stabilizer code, the 
Steane code, and the Shor code are all designed to cor- 
rect a single error on an arbitrary qubit, and fail only if 
multiple qubits are corrupted. For a fixed 7, the prob- 
ability of a multiple qubit error rises as the number of 
physical qubits n increases. 
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FIG. 5: EigQER and standard QEC recovery performance for 
the five, seven, and nine qubit codes and the amplitude damp- 
ing channel. Note that generic QEC performance decreases 
for longer codes, as multiple qubit errors become more likely. 
While the EigQER performance for the nine qubit Shor code 
is excellent, the seven qubit Steane code shows only mod- 
est improvement, with performance similar to the generic five 
qubit QEC recovery. 



The QEC performance degradation with code length is 
a further illustration of the value of channel adaptivity. 
All three codes in Figure [5] contain one qubit of infor- 
mation, so longer codes include more redundant qubits. 
Intuitively, this should better protect the source from er- 
ror. When we channel adapt, this intuition is confirmed 
for the Shor code, but not for the Steane code. In fact, 
the EigQER entanglement fidelity for the Steane code is 
only slightly higher than the generic QEC recovery for 
the five qubit code. From this example, it appears that 
the Steane code is not particularly well suited for adapt- 
ing to amplitude damping errors. We see that the choice 
of encoding significantly impacts channel-adapted recov- 
ery. 

The effect is even more dramatically (and puzzlingly) 
illustrated in the pure state rotation channel. Figure 
[6] compares the EigQER recoveries for the five qubit, 
Steane, and Shor codes with 8 — 5tt/12. It is interesting 
to see that the five qubit code outperforms each of the 
others despite less redundancy to protect the informa- 
tion. Furthermore, both the standard QEC and channel- 
adapted recoveries for the Steane code perform worse 
than the generic recovery of the Shor code! This sug- 
gests that the five qubit code is particularly well suited 
to adapt to errors of this type, while the Steane code is 
particularly ill-suited. (We suspect that the Shor code 
with QEC recovery outperforms the Steane due to its 
degenerate structure.) 




FIG. 6: EigQER and standard QEC recovery performance for 
the five, seven, and nine qubit codes and the pure state rota- 
tion channel with 6 — 5tt/12. Despite the least redundancy, 
the five qubit code has the best channel-adapted performance. 
The Steane code appears particularly poor for this channel: 
both the generic QEC and the adapted recovery have lower 
fidelity than the other codes. 

IV. BLOCK SDP QER 

The recovery operation generated by the EigQER algo- 
rithm of the preceding section is one of a broader class of 
quantum error recoveries. The class is characterized by 
an initial projective syndrome measurement, followed by 
a syndrome-specific recovery operation. The projective 
measurement partitions "He and provides some knowl- 
edge about the observed noise process. 

Projective syndrome measurements for quantum error 
correction are tricky to design. We wish to learn as much 
as possible about the error while learning as little as pos- 
sible about the input state, so as not to destroy quan- 
tum superposition. The EigQER algorithm aggressively 
designs the syndrome measurement, as the Rk = UkPk 
structure of the operator elements implies a finality about 
the syndrome selection. The outcome of the syndrome 
measurement completely determines the correction term 

We can conceive of a less aggressive projective mea- 
surement. If we projected onto larger subspaces of Tic, 
we would learn less about the noise but perhaps have less 
chance of destroying the superposition of the input state. 
We could consider this an intermediate syndrome mea- 
surement, a preliminary step to further error correction. 
To design a recovery operation of this type, we must have 
a strategy to select a projective measurement. Given the 
outcome Pk , we must further design the syndrome recov- 
ery operation IZk- 

Consider the projective syndrome measurement oper- 
ator Pfe. For the EigQER algorithm, Pk — R k Rk always 
projects onto a subspace of dimension less than or equal 



8 



to the source space: rank(Pfc) < ds- This is an aggressive 
condition that arises from constraining the subsequent 
syndrome recovery to be a unitary operator. We will 
relax this constraint and allow an arbitrary syndrome re- 
covery IZk for the k th syndrome measurement. It turns 
out that we can determine the optimum such recovery 
7?.^ pt via semidefinite programming, just as in Q. The 
intermediate syndrome measurement Pk reduces the di- 
mension of the SDP, and thus the technique is still appli- 
cable to long codes where computing the global optimum 
recovery is impractical. 

We will demonstrate how the optimum syndrome re- 
covery TZk can be calculated via a semidefinite pro- 
gram. Let {Pk}k=i be a set of projectors such that 
Sfc Pk = I S Tic that constitute an error syndrome 
measurement. Let Sk be the support of Pk with dimen- 
sion dk] it is clear that S\ © 1S2 © • • • © Sk — He- Given 
the occurrence of syndrome k, we must now design a re- 
covery operation IZk '■ Sk 1— » Ti-s- TZk is subject to the 
standard CPTP constraint on quantum operations, but 
only has support on Sk- We may calculate the recovery 
7Zk that maximizes the average entanglement fidelity us- 
ing the SDP in a structure identical to that of [|| while 
accounting for the reduced input space: 



1 f-^- 



X Uk = argmaxtrX(C E , £ ) fe , 
such that X > 0, tr Hs X = I e S k - 



(22) 
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Here, (CE,s)k = I ® PrCe^I ® Pk is the data matrix 
projected into the k th subspace. Notice that X Kk and 
{CE,s)k are operators on TLs ® S%. In contrast to Ce,s, 
which requires dgd^ matrix elements, (CE.s)k is fully 
specified by d 2 s d\ matrix elements. By partitioning He 
into subspaces {Sk} through a careful choice of a syn- 
drome measurement {Pk}, we may apply semidefinite 
programming to high dimensional channels without in- 
curring the full computational burden of computing the 
optimal recovery. In the following sections we discuss two 
strategies for determining the syndrome measurement. 



FIG. 7: BlockEigQER performance for the five qubit code 
and the pure state rotation channel with 9 = 5tt/12. Block- 
EigQER is computed with fixed block lengths of 2, 4, and 8. 
In (A) we compare the entanglement fidelity to the EigQER 
recovery, standard QEC recovery and Single Qubit baseline. 
The different block lengths have nearly indistinguishable per- 
formance from EigQER. In (B), we compute the fidelity rel- 
ative to the EigQER recovery and show that the fidelity im- 
proves by less than 4% for the displayed region. We can 
note, however, that longer block lengths tend to better per- 
formance. 



A. Block EigQER 

The first step of an iteration of EigQER computes the 
dominant eigenvalue and corresponding eigenvector of 
Ce,s- This eigenvector corresponds to the operator that 
maximizes the average entanglement fidelity gain at a sin- 
gle step. While such an operator may violate the CPTP 
constraint for the recovery operation, it serves to iden- 
tify an important subspace onto which we may project. 
Indeed, the good performance of the EigQER algorithm 
rests on the successful identification of suitable syndrome 
subspaces via eigen-analysis. 

An intuitive extension of this concept is to use multiple 
eigenvectors to specify a higher-dimension subspace. If 
{\X m ))}m = i are the eigenvectors corresponding to the 
M largest eigenvalues of Ce.s, then it is reasonable to 
define the subspace S± as the union of the support of the 



operators {X m }. We define the corresponding projector- 
Pi and calculate the syndrome recovery TZi via the SDP 
of (|22p. As in the EigQER algorithm, we update the data 
matrix C by projecting out the subspace <Si, at which 
point we select another set of eigenvectors. We will refer 
to this algorithm as BlockEigQER. 

How many eigenvectors should be selected to define 
a block? A simple solution is for a fixed block size, 
say M, to be processed until the recovery is complete. 
For M = 1, BlockEigQER is identical to EigQER. For 
M = d$dc : BlockEigQER computes the optimal recov- 
ery operation, as the syndrome measurement is simply 
the identity operator. For values in between, one would 
expect to trade off performance for computational bur- 
den. While there is no guarantee that performance will 
improve monotonically, we would anticipate improved 
performance as M increases. 

We illustrate the performance for several choices of 
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FIG. 8: BlockEigQER for the amplitude damping channel 
and a random [6,2] code. We compare the BlockEigQER al- 
gorithm for block sizes of 2,4, and 8 with EigQER algorithm. 
We see significant performance improvement for larger block 
sizes, at the cost of computational and recovery complexity. 
Baseline in this case is the entanglement fidelity for two qubits 
input to the channel without error correction. 



M in Fig. [7] We use the pure state rotation channel 
(6 = 5tt/12) and the five qubit code with block sizes of 2, 
4, and 8. The expected improvement as M increases is 
evident, though the gain is quite modest for noise levels 
of interest (below the cross-over with the single qubit re- 
covery) and is not strictly monotonic. The variations in 
performance , including the non-monotonicity, are likely 
the result of syndrome measurements that collapse the in- 
put superpositions. While the eigenvectors of Ce,s that 
identify the syndrome subspace generally avoid collaps- 
ing the input state, the mechanism is imperfect. 

While BlockEigQER outperforms EigQER in the [5, 1] 
code, we see in (B) of Fig. [7] that the improvement is less 
than 5% within the <f> of interest. We see more signifi- 
cant gains when we encode multiple qubits. Consider a 
random [6, 2] encoding for the amplitude damping chan- 
nel, shown in Figure [H In this case we see a distinct 
performance gain as M increases and the difference is 
non-trivial. 

Fixing the block size M ignores some of the inherent 
symmetries in the channel and encoding. In particular, it 
is quite common for Ce.s to have degenerate eigenvalues. 
By fixing the number of eigenvectors to simultaneously 
consider, one may inadvertently partition such a degen- 
erate subspace according to the numerical precision of 
the eigen- analysis software. To avoid this unwanted cir- 
cumstance, we may select a variable block size based on 
the magnitude of the eigenvalues. This approach necessi- 
tates a strategy for parsing the eigenvalues into variable 
size blocks which can be a tricky procedure. Due to the 
modest returns of such an attempt, we have not pursued 



such a strategy. 

While BlockEigQER shows modest performance im- 
provements when compared to EigQER, it has one sig- 
nificant drawback. Unlike EigQER, the recovery opera- 
tion from BlockEigQER is not constrained to a collection 
of isometrics. Once the initial projective syndrome mea- 
surement is performed, the subsequent correction terms 
are arbitrary CPTP maps. This may complicate at- 
tempts to physically implement such an operation. Fur- 
thermore, BlockEigQER does not provide much more in- 
tuition for recovery design than EigQER. For this rea- 
son, we consider BlockEigQER a numerical tool whose 
principal value is its incremental improvement approach- 
ing optimality. It also prove useful for the performance 
bounds derived in Section fVl 



B. OrderQER 

We now consider a block QER algorithm that provides 
intuition for error recovery design. We are often inter- 
ested in channels where each qubit is independently cor- 
rupted; thus the overall channel is the tensor product of 
single qubit channels. We can use this structure to design 
an intuitive projective measurement. We illustrate using 
the classical bit flip channel with probability of error p. 
If a single bit of the codeword is flipped, we label this 
a 'first order error' as the probability of such an error is 
0(p). If two codeword bits are flipped, this is a 'second 
order error', which occurs with probability 0(p 2 ). 

This intuition can easily yield a choice of syndrome 
subspaces {Sk}- Consider, for example, the amplitude 
damping channel given in (f20|) . Recognizing E\ as the 
'error event,' we declare first order errors to be of the 
form E\ = Eq • • • E\ ® Eq <g) • • • where the error is on 
the fc th qubit. In this case we can declare the first order 
syndrome subspace to be 

S, = span({|<"0L) , K n l L ) , \El0 L ) , \E\l L ) \E l n l L )}), 

(23) 

where |0l) and are the logical codewords for an n- 
length code. We include the 'no error' term as numerical 
experience suggests that the code projector Pc is not 
always an optimal syndrome measurement. By parallel 
construction, we can define the second order syndrome 
subspace 6>2- While these two will probably not com- 
plete the space TCc, quite possibly we may neglect any 
higher orders. Alternatively we can analyze the remain- 
ing subspace with either the SDP or the numerically sim- 
pler EigQER algorithm. We will refer to this block SDP 
algorithm as OrderQER. 

The SDP's for first and second order subspaces signif- 
icantly reduce the dimension from the full optimal SDP, 
though the effect is not as dramatic as BlockEigQER. 
Consider the case of the amplitude damping channel 
which has only two operator elements for the single qubit 
channel. For an [n, k] code, there is one 'no error' opera- 
tor and n first order error operators. This suggests that 
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FIG. 9: OrderQER recovery for the seven qubit Steane code 
and the amplitude damping channel. We compare the recov- 
ery fidelity of the I s * order error to the standard QEC perfor- 
mance. The performance of the I s * and 2 nd order recoveries 
together are comparable to the EigQER recovery, especially 
as 7 approaches 0. 



Si has dimension (n + l)ds = (n + l)2 fc . The SDP then 
has (n + l) 2 2 4fc optimization variables. Contrast this n 2 
growth with the 4™ growth of the optimal SDP. For sec- 

?Y error operators. 



ond order errors, there are 



The subspace S2 has approximate dimensions of n 2 2 k ~ 1 
and thus the SDP has n 4 2 4fe_2 optimization variables. 
For the [7, 1] Steane code, computing the full optimal 
SDP requires an impractical 4 7 • 4 = 65536 variables. 
However, the first order SDP requires 8 2 2 4 = 1024 vari- 
ables and the actual second order SDP has 42 2 • 4 = 7056 
optimization variables. For contrast, the full SDP and 
the five qubit code requires 1024 optimization variables. 
For the [9, 1] Shor code, the second order SDP has an 
impractical 72 2 • 4 = 20736 optimization variables. We 
therefore do not use OrderQER for the Shor code. 

While the scaling of OrderQER grows quickly with 
n making its use challenging for codes as long as nine 
qubits, OrderQER results provide significant insight into 
the mechanism of channel-adaptation. Consider the 1 st 
and 2 nd order recovery performance for the Steane code 
and the amplitude damping channel from Figure [5] We 
note that the fidelity performance for the recovery from 
S\ is comparable to the performance of standard QEC, 
especially as 7 approaches 0. This matches the intuition 
that standard QEC is correcting single qubit errors which 
are almost completely restricted to £>i- For small 7, the 
most likely syndrome measurement will be a Pauli X or 
y, as these characterize single qubit dampings. These 
same errors are corrected by I s * order OrderQER. As 7 
grows, the distortion from the 'no error' term Eg® - ■ -(3Eq 
becomes more pronounced and the QEC outperforms I s * 



order OrderQER. 

We see that I s * and 2 nd order recovery performance 
is quite comparable to the EigQER performance. Thus, 
the performance gains observed for channel adapted QER 
can be understood as corrections of higher order errors. 
Since Si has dimension significantly less than dc and yet 
approximates the QEC recovery performance, it is only 
reasonable that the remaining redundancy of the code 
can be exploited to protect from further error. 



V. QER PERFORMANCE UPPER BOUND 

In the preceding sections, we imposed constraints on 
the recovery operations to provide structure and aid com- 
putation. While the resulting channel-adapted recoveries 
out perform the generic QEC recovery operation in all of 
the examples, the constraints essentially guarantee sub- 
optimality. For the five qubit code (where computation 
of the optimal QER operation is practical), we observe 
that the proposed algorithms (EigQER, BlockEigQER, 
and OrderQER) closely approximate the optimal perfor- 
mance. This anecdotal evidence, however, is hardly suf- 
ficient to justify the bold description in the title of 'near- 
optimal' channel-adapted QER. In this section, we more 
fully justify the near-optimal label by deriving channel- 
adapted performance bounds. We accomplish this by 
using the Lagrange dual function. 

Every optimization problem has an associated dual 
problem 17(. Derived from the objective function and 
constraints of the original optimization problem (known 
as the primal problem), the dual problem optimizes over 
a set of dual variables often subject to a set of dual con- 
straints. The dual problem has several useful properties. 
First of all, the dual problem is always convex. In many 
cases, calculation of the dual function is a useful method 
for constructing optimization algorithms. Most impor- 
tant for our purposes, the dual function provides a bound 
for the value of the primal function. We define a dual fea- 
sible point as any set of dual variables satisfying the dual 
constraint. The dual function value for any dual feasible 
point is less than or equal to the primal function at any 
primal feasible point. (We have implicitly assumed the 
primal function to be a minimization problem, which is 
the canonical form.) 

The dual function for channel-adapted recovery was 
derived in [7j ; we will re-derive it here in a notation more 
convenient for our purposes. 

The primal problem as given in [|| can be stated suc- 
cinctly as 



min — trXCE,s, such that X > and tr^gX = /• 

(24) 

The negative sign on the trX Ce,s terms casts the primal 
problem as a minimization, which is the canonical form. 
The Lagrangian is given by 

L(X,Y,Z) = -trXC E ,s+trY(ti Hs X-I)-tiZX, (25) 
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where Y and Z > are operators that serve as the la- 
grange multipliers for the equality and generalized in- 
equality constraints, respectively. The dual function is 
the (unconstrained) infimum over X of the Lagrangian: 



g(Y,Z) = ML(X,Y,Z) 



(26) 



= inf -trX(C E ,s + Z-I®Y)-tiY, (27) 

where we have used the fact that tr{Ytr'n s X) = tr(7 £g) 
Y)X . Since X is unconstrained, note that g(Y, Z) = — oo 
unless Z — I <£> Y — Ce,s in which case the dual func- 
tion becomes g(Y, Z) = — trY. Y and Z > are the 
dual variables, but we see that the dual function de- 
pends only on Y. We can therefore remove Z from 
the function as long as we remember the constraint im- 
plied by Z = I (£) Y — Ce,s- Since Z is constrained to 
be positive semidefinite, this can be satisfied as long as 

I®Y-C E ,s >0. 

We now have the bounding relation — \xXCe,z > tr—Y 
for all X and Y that are primal and dual feasible points, 
respectively. If we now reverse the signs so that we have 
a more natural fidelity maximization, we write 



F e (EK o £) = trX n C E , £ < trY, 



(28) 



where TZ is CPTP and I <g> Y - C E ,e > 0. To find the 
best bounding point Y, we solve the dual optimization 
problem 



min trY, 

Y 



such that I ® Y - C E .s > 0. 



(29) 



Notice that the constraint implies that Y = Y\ Note 
also that Y G C{H* C ). 

We will use the bounding property (|2"8")1 of the dual 
function. Given any dual feasible point Y G C{TL* C ), we 
know that trY upper bounds F e (E, TZo£) for all TZ; Y is 
thus a certificate of convergence for a recovery operation. 

To provide a good performance bound, it is desirable to 
find a dual feasible point with a small dual function value. 
Indeed, the best such bound is the solution to (|2"9"|) , that 
is to find the dual feasible point with the smallest trace. 
However, finding the optimal Y is the equivalent of solv- 
ing for the optimal recovery due to the strong duality of 
the SDP. As this suffers the same computational burden 
as computing the optimal recovery, we require an alter- 
nate method for generating useful dual feasible points. 
We will establish methods to convert the sub-optimal 
recovery operations of the preceding sections into dual 
feasible points. 

We need to determine a good dual feasible point begin- 
ning with one of the sub-optimal recoveries computed by 
the EigQER, BlockEigQER, or OrderQER algorithms. 
We utilize the structure of the sub-optimal recovery op- 
erations to generate a dual feasible point. We present two 
methods that exploit the projective syndrome measure- 
ment to achieve performance bounds. The first bound is 
motivated by the proof of Theorem [3] in the appendix, 
where the optimal dual feasible point is constructed 



for Pauli group errors. Beginning with this construc- 
tion and the recovery generated by EigQER, we use the 
Gersgorin disc theorem to generate a dual feasible point. 
The resulting dual function we denote the Gersgorin 
dual bound. The second construction iteratively gener- 
ates dual feasible points given an initial infeasible point. 
While it is more computationally burdensome, it gen- 
erates tighter bounds for the considered examples. We 
begin with a trial dual variable that may or may not be 
feasible and iteratively extend this point until it is feasi- 
ble. We call this construction the iterative dual bound. 
We present several methods for providing an initial trial 
point. 

Discussion of both bounding methods is facilitated by 
choosing an appropriate basis for Tis ® Tic- Both meth- 
ods begin with a recovery operation generated by one of 
the structured sub-optimal methods. As they all begin 
with a projective measurement, the recovery provides a 
partition of Tic into subspaces S q of dimension d q de- 
scribed by projection operators {P q } G C(TLc)- We are 
interested in a basis {|vi)}f =1 where the first block of 
dsdo basis vectors span I gj Sq and the q th block spans 



I ® S* . Let us define 



{C E ,s) qq > =1® PqC E ,sI ® Pq' 

as we did in (1221) and then write 



(30) 



Ce,£ — 



(,Ce, e)oo ••• (CE,s)o q 



(CE,s)qO ■ ■ ■ {C E ,s)qq 



(31) 



in our defined basis. This block structure delineates the 
relationship of the data operator C E ,s on each of the 
subspaces S q which will be useful when discussing dual 
feasible points. 



A. Gersgorin Dual Bound 

The first method for constructing dual feasible points 
imposes a convenient structure on Y. In the case of Pauli 
group errors considered in [l9j , the optimal dual feasible 
point has the form 



Y = J2wqPq, 



(32) 



where w q are a set of weights corresponding to the prob- 
ability of the most likely error resulting in the q th syn- 
drome measurement. The form of (|32j) is appealing due 
its simplicity, especially for the EigQER recovery oper- 
ation where the rank d q of the P q is constrained to be 
< ds- While we cannot necessarily generate the optimal 
dual feasible point in this form for non-Pauli errors, we 
can use similar methods to generate a reasonable perfor- 
mance bound. 
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Before we state the Gersgorin dual bound, we take 
a second look at the optimal dual point for Pauli er- 
rors. For an [n, k] stabilizer code, recall that Tic is par- 



titioned into 2" k 
tablish a basis {\m) 
mined that \U Cq A p ) 
1 



syndrome subspaces S q and we es- 
q] for each subspace. We also deter- 
> is an eigenvector of Ce s- Note that 



{\U Cq A p ))} p=0 span the space I ® S q 



If we write out the operator {Ce,s , 
have 



<i<i 



in this basis, we 



(C E ,e) qq 



«(2 2fc -l)g_ 



(33) 



which is diagonal because are eigenvectors of 

Ce.s- This also implies that all of the off-diagonal blocks 
(CE,e)qq' where q ^ q' are also 0. We can now see that 



Y = E q a, 
point since 



qPq 



where a 



max„ \a v 



is a dual feasible 



aol 
a x I 





«2" 



(34) 



is diagonal in the chosen basis. 

We return now to the general case. Unlike in the case 
of a Pauli error channel and a stabilizer code, we can- 
not guarantee that Ce,s will be either diagonal or block 
diagonal in this basis. However, if our sub-optimal recov- 
ery 1Z is generated from the EigQER algorithm, then the 
subspaces S q are selected based on the eigenvectors of 
Ce.s and we can expect Ce,s to be approximately block 
diagonal when we partition according to the subspaces 
I <$ S*. We say that Ce,s is approximately block diago- 
nal in this basis if \\(CE,s) qq II » II (Ce ,s)qq'\\ for <7 7^ <?'• 

To generate a dual feasible point of the form Y = 
J2 q w qPqi we need to choose w q so that I®Y — Ce,s > 0. 
If Ce,c were exactly block diagonal in this basis, we could 
accomplish this by setting w q = X m&K ((CE,s)qq)- Since 
the block terms off the diagonal are not strictly 0, we 
must account for their contributions in the location of 
the eigenvalues of Ce,s- 

We will make use of a linear algebra theorem known 
as the Gersgorin disc theorem. This theorem provides 
bounds on the location in the complex plane of the eigen- 
values of an arbitrary matrix. As will be evident, the 
theorem is most valuable when the matrix is dominated 
by its diagonal entries. We state the theorem as it is 
given in [20j § 6.1: 

Theorem 2. Let A = [ay] € C nxn , and let 



%(A)= M 1<*<™ (35) 

3=1,3^* 

denote the deleted absolute row sums of A. Then all the 



eigenvalues of A are located in the union of n discs 

n 

\J{z G C : \z - an\ < R'M)} = G(A). (36) 

i=l 

Furthermore, if a union of k of these n discs forms a 
connected region that is disjoint from all the remaining 
n — k discs, then there are "precisely k eigenvalues of A 
in this region. 

Theorem [5] is particularly useful for proving the posi- 
tivity of a matrix. The R[ (A) are the radii of discs cen- 
tered at the diagonal entries an and the eigenvalues are 
constrained to lie within the union of these discs. If A is 
a Hermitian matrix, then we can be certain it is positive 
semidefinite if an > R[ (A) for all i as all of the eigenval- 
ues would be constrained to lie to the right of the origin 
(or on the origin) on the real line. 

We can apply Theorem [5] to generating a dual feasible 
point structured as (|32p . In this case we use the weights 
w q to ensure that the diagonal entries of I <£> Y — Ce,s 
are greater than the deleted absolute row sums. Let 
denote the matrix elements of Ce.s in our defined basis 
and let the basis vector \vi) lie in the subspace S q . We 
then the have the i th diagonal element [I®Y — Ce,£\u = 
w q — Cii and the i th deleted absolute row sum is Ylijtj \ c ij I ■ 
We can assure non-negativity if 



w„ > 



E 



for all i such that 



€S q . 



(37) 



Thus, we can guarantee a dual feasible point if w q is set 
to be the maximum absolute row sum for all rows i such 
that \vi) £ S q . We may express w q concisely in terms of 

5.6.5), denoted || • ||oo: 



the induced oo-norm([20( 

W q = \\[(C E ,e)gO 



{C E ,E)qq 



PqC E , £ \ 



(38) 
(39) 



The Gersgorin disc theorem is a computationally sim- 
ple way to guarantee construction of a dual feasible point 
given a partition of He into subspaces {S q }. Unfor- 
tunately, the induced infinity norm does not provide a 
particularly useful performance bound as can be seen in 
Figure 1101 When we compare to the optimal recovery 
performance for the five qubit code and the amplitude 
damping channel, we see that the dual bound is far from 
tight. In fact, for many values of 7, the bound is greater 
than 1, which is truly useless for upper bounding fideli- 
ties. While we have generated a dual point Y that is 
guaranteed to be feasible, such a guarantee imposes too 
strict a cost to have a useful bounding property. 

The Gersgorin dual bound provides useful insight for 
a tighter dual construction. If we replace the induced 
infinity norm with the induced 2-norm, we generate a 
dual point that is often dual feasible. That is, choose 



= ||/®P,C!B,fi||a 

= max ((x 1 7 <g> PqC E .£\x)', 

= o- max {I ® P q C E ,e), 



(40) 
(41) 

(42) 
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FIG. 10: Gersgorin and SVD dual bound for the ampli- 
tude damping channel and the 5 qubit stabilizer code. The 
Gersogrin bound is clearly not very useful as in some cases 
it is greater than 1. The SVD dual bound clearly tracks the 
optimal performance, although the departure from optimal of 
the bound exceeds the EigQER recovery. 

where a max (-) in (I42[) indicates the maximum singular 
value and is the computational method for the induced 
2-norm. We will refer to this construction as the SVD 
(for singular value decomposition) dual point. The Y 
generated in this way is not guaranteed to be dual feasible 
as was the case with the oo-norm, but has proven to be 
dual feasible in all of the examples that we have tried. If 
for some circumstance the SVD dual point is not feasible, 
it can be iteratively adjusted to become dual feasible in 
a manner we present in the following section. 



B. Iterative Dual Bound 

We now present an iterative procedure to generate a 
dual feasible point given an initial dual point that is 
presumably not dual feasible. After presenting the algo- 
rithm, we will discuss choices for the initial dual point. 

At the k th iteration, we update the dual point to pro- 
duce Y^ k ' until we achieve feasibility. For convenience 
we will define 

zW = I®YM-C E ,e- (43) 

Let x and \x)) be the smallest eigenvalue and associated 
eigenvector of Z^ k \ If x > 0, we may stop, as Y^ 
is already dual feasible. If x < 0, we wish to update 
a small amount to ensure that ((x\Z<- k +V\x)) > 0. 
Essentially, we are replacing a negative eigenvalue with 
a eigenvalue. Given no constraints on the update, we 
could accomplish this as Z^ k+1 ^ = Z^ + x\x})((x\ but 
we must instead update with the tensor product 

structure implicit. 



We determine the properly constrained update by 
means of the Schmidt decomposition of the eigenvector: 

14 =Y^*i\Xi)Hs\Xi)H' c - (44) 

i 

As we can only perturb Z^ in the TCq slot, we 
choose the smallest perturbation guaranteed to achieve 
((x\Z( k+r >\x)) > 0. Let 

Yik+D =Y W + P-\x l ){x 1 \. (45) 

Then 

((x\Z^\x)) = a! + ^L«i|(/®|i 1 )(£ 1 |)| a: ))(46) 

= *+^T|Ai| 2 (47) 

= 0, (48) 

since x < 0. While we have not yet guaranteed that 
> 0, | a;)) is no longer associated with a negative 
eigenvalue. By repeatedly perturbing Y^ in this man- 
ner, we iteratively approach a dual feasible point while 

adding as little as possible to the dual function value 
tr y (fe) _ 

As a final point, we demonstrate that the iterative pro- 
cedure will converge to a dual feasible point. Let's con- 
sider the effect of the k th iteration on the space orthog- 
onal to \x)). Let |y)) G 7is <£> 7~Lq be orthogonal to \x)). 
Then, for Z^ 1 ) we see that 

«y|^ fc+1 >|y» = «y|^ fe) |y» + Am*® I*i> <*i|)|yp9) 
But since I \xi) (xi\ > we see that 

({y\ZV*%)) > {(y\ZM\y)) (50) 

for all \y)) eH s ®H* c . We see that the update to Y^ 
moved one negative eigenvalue to while no new nega- 
tive eigenvalues can be created. Thus the procedure will 
eventually converge to a dual feasible point. 

C. Initial dual points 

Having established a procedure to generate a dual fea- 
sible point given an arbitrary intial point Y^°\ we now 
present initialization options. While we can start with 
any Hermitian operator in C(TIq) including 0, we do not 
recommend such an unstructured choice as each itera- 
tion is imperfect. Each iteration adds |a;|/|Ai| 2 to the 
dual function value. If |Ai| is not close to 1, the iteration 
is not efficient. We will use more educated initializations 
to begin closer to feasibility, thus minimizing the num- 
ber of iterations and improving the bounding properties 
of the resulting dual feasible point. 
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We have already presented one method for initializa- 
tion with the SVD dual point. In most cases we've seen, 
this point is already feasible and in fact is a relatively 
loose bound. Its advantage lies in its easy computation, 
but other choices provide better bounding properties. We 
would prefer an initial such that is non-positive 
with eigenvalues very close to 0. If this is the case, we will 
require only small perturbations (and thus a small dual 
function value) to achieve a positive semidefinite Z^ k \ 

Consider an initial Y^°> of the form given in (j32|) . We 
choose an initial Y^ in the same way that was used in 
the proof of Theorem [3] 



q — \nax((CE,£)qq)- 



(51) 



This is very simple to calculate, though it will not gen- 
erally be dual feasible. This is the logical choice when 
we begin with the EigQER recovery, as the only useful 
information we have is the projective syndrome measure- 
ment. This initialization often iterates to a better bound 
than the SVD dual point and requires no further infor- 
mation than the partition {S q } provided by any of the 
structured QER methods. It has one drawback, how- 
ever, in that almost certainly has eigenvalues much 
greater than 0. For the \vi) associated with the largest 



eigenvalue of (Ce,s) 



qqi 



Z^ \vi) = 0. However, unless 
[pE,e)qq nas only one distinct eigenvalue there will be 
vectors |x)) € S q such that ((x\Z^\x)) > 0, and perhaps 
quite large, relatively. Such vectors indicate portions of 
the Hubert space where Y^ is already greater than the 
optimal dual feasible point. While this likely cannot be 
avoided in the iterations, it seems wasteful to begin at 
such a point if not necessary. 

We have an alternative choice for Y^ arising from the 
block SDP QER algorithms of Sec.|lVl These algorithms 
already provide information useful for generating a dual 
feasible point. When solving the SDP on a subspace S q 
one can simultaneously generate the optimal dual func- 
tion value Y* £ C(S*). Given such optimal subspace 
dual points, define the block diagonal operator 



y(°) = 



'Yet 



Y* 
1 'i 



(52) 



as the initial point. We know that 7(g) Y* — (CE,s)qq > 0, 
so there will be |a;)) for which ((x\Z^\x)) > 0. However, 
since Y* is optimal within C(S*), we know that we are 
not being overly wasteful with the initialization. 



D. Iterated block dual 



Let's consider the computational burden of the iter- 
ated dual bound. At each iteration we must compute the 
smallest eigenvalue and associated eigenvector of Z^ k \ a 



2 n+k x 2 n+k Hermitian matrix. (We can accomplish this 
by looking for the largest eigenvalue of rjl — Z^ where 
rj > 1 is an arbitrary offset to ensure positivity.) This 
must be repeated at most 2 n+k times to ensure dual fea- 
sibility, though there may be significantly fewer iterations 
if the Z^> is nearly positive semidefinite already. As men- 
tioned in Sec. lIIIl this can be accomplished in 0(2 2 (" +fe ') 
flops by the power method. This is very costly if we must 
repeat the iteration many times. 

The block diagonal structure of the initial points sug- 
gests a slightly modified alternative procedure with some 
computational advantage. Consider the optimal dual 
points Yi and Yj in £{S*) and C(S*). We can use the 
same iterative procedure as before to compute a dual fea- 
sible Yij € C(S* S S*) requiring only 0(2 2k (di + d,) 2 ) 
flops per iteration with a maximum of 2 k (di + dj) itera- 
tions. We can generate a dual feasible point on the whole 
space £(7i^) by successively combining subspace blocks. 
Eventually we will have to iterate over the full space, but 
we will have done most of the work in the smaller blocks, 
and the full 2 n+k x 2 n+k eigen decomposition will require 
few iterations. 

In the examples we have processed, the iterated block 
dual procedure created nearly identical bounds (often 
within 10~ 5 of each other and never more than 10~ 4 ) 
as the original algorithm. The computational burden is 
reduced by approximately 20%. 



E. Examples 

We provide several examples to demonstrate the util- 
ity of the iterated dual bound. At the same time, we we 
illustrate the near optimality of the structured QER algo- 
rithms. In Fig. lilt we show several bounds for channel- 
adapted QER for the amplitude damping channel and 
the five qubit code. In this case, we know the optimal 
performance and can see that the iterated dual bound, 
beginning with the BlockEigQER with M = 2, is quite 
tight. This is in contrast to the SVD dual bound, which 
was also shown in Fig. [TO] We have included in Fig [11] 
the numerical channel-adapted recovery and performance 
bound from [lCj. We see that this bound is looser than 
even the SVD dual bound for this example. 

Figure [T2l shows several dual bounds for the amplitude 
damping channel and the nine qubit Shor code. While 
we cannot compute the optimum directly, we see that 
the EigQER performance curve and the iterated bound 
derived from BlockEigQER with M — 2 are essentially 
equivalent. We can conclude that EigQER operation is 
essentially optimal in this case. While not shown, itera- 
tions for BlockEigQER with M = 4 and M = 8 achieved 
essentially the same bound. Note that neither the SVD 
dual bound nor the iterated bound beginning with the 
EigQER recovery operation are tight, illustrating the im- 
portance of a good initialization for the dual iterations. 

Our final example is the pure state rotation channel 
with 9 = 57r/12 and the seven qubit Steane code. In 
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FIG. 11: Dual bound comparison for the amplitude damping 
channel and the five qubit code. The iterated dual initial- 
ized with the Block EigQER algorithm with M = 2 is es- 
sentially indistinguishable from the optimal recovery perfor- 
mance, thus producing a very tight bound. Included for com- 
parison are the EigQER performance, the SVD dual bound, 
and both a channel-adapted recovery and associated bound 
derived by Barnum and Knill in [Tol | . 
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FIG. 13: Dual bound comparison for the pure state rotation 
channel with 6 = 5-7r/12 and the seven qubit Steane code. 
Note that the iterated bounds are generally, though not uni- 
versally, better than the SVD dual bound. We also see that 
the shorter block lengths for the BlockEigQER algorithm gen- 
erally produce a tighter bound, despite slightly poorer recov- 
ery performance. 
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FIG. 12: Dual bound comparison for the amplitude damping 
channel and the nine qubit Steane code. The iterated dual 
bound initialized with the BlockEigQER recovery with M = 2 
produces a bound that is tight to the EigQER recovery opera- 
tion. This demonstrates that the EigQER recovery operation 
is essentially optimal in this case. Notice that the iterated 
bound initialized with the EigQER recovery operation does 
not generate a tight bound. 



Fig. [T21 we can distinguish between several initialization 
methods for the dual iterative bound. We see that none 
of the recovery operations approach the bound perfor- 
mance for large <p, though the performance is relatively 
tight as the noise level drops ((f) — > 0). Notice that in 



general the iterative bounds are better than the SVD 
dual bound, however there are points, especially for the 
BlockEigQER algorithm with M = 8, where the iter- 
ated bound is poor. It is interesting to note that the 
longer block lengths (larger M) usually generate better 
recovery performance (which can be seen with slight im- 
provement even in this case) yet often produce poorer 
bounds. Anecdotal experience suggests that the best it- 
erative starting point is the BlockEigQER recovery op- 
eration with M = 2. 

Finally, we should point out the gap for large <f> be- 
tween the recovery performance and the dual bounds. 
Absent a better recovery operation or a smaller perfor- 
mance bound, we have no way to know whether the 
bound or the recovery is further removed from the op- 
timal. However, this region is below the baseline perfor- 
mance for a single unencoded qubit, and thus is not of 
serious concern. 



VI. CONCLUSION AND FUTURE WORK 

Adapting a quantum recovery operation to a physical 
channel can significantly improve the effectiveness of a 
quantum channel. In this way, quantum error correc- 
tion can be made more efficient, which should aid in 
scaling physical implementations to a larger number of 
qubits. While the optimal recovery (in terms of aver- 
age entanglement fidelity) may be calculated via convex 
optimization of a semidefinite program, we have derived 
a class of near-optimal algorithms that are less compu- 
tationally intensive. Furthermore, these algorithms yield 
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recovery operations of a particular form: they implement 
a projective error syndrome measurement followed by a 
syndrome recovery operation. This structure may prove 
easier to implement physically and provides intuition into 
the mechanism for channel-adaptation. 

Despite the reduction in computation from the SDP, 
even these algorithms grow exponentially in the length of 
(i.e. the number of qubits in) the code. For this reason, 
the next step toward practical application of channel- 
adapted quantum error correction must include analyti- 
cal tools to supplement these numerical techniques. Fur- 
thermore, to apply channel-adapted methods to fault- 
tolerant quantum computing, we must show how errors 
propagate from block to block. These two open ques- 
tions arc likely closely linked. Despite these obstacles, 
the added efficiency of channel-adapted recovery suggests 
significant value for practical efforts in quantum error 
correction. 
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VII. APPENDIX 

The discussion of the Gersgorin and SVD dual bounds 
make use of a structured dual feasible point. This is mo- 
tivated by the optimal dual feasible point for a stabilizer 
code and a Pauli error channel. Construction of this op- 
timal dual feasible point proves the intuitive structure of 
the optimal recovery operation for Pauli error channels. 
This theorem was proven in [l|| and will be restated here 
for reference. 

We can construct the optimal recovery operation for a 
stabilizer code when the channel £' is characterized by 
Pauli group errors and the input ensemble is the com- 
pletely mixed state. That is, E is given by p = I/ds 
with p = 1 and the channel can be represented by Kraus 
operators {Ei} where each Ei is a scaled element of the 
Pauli group. (Notice that this does not require every 
set of Kraus operators that characterize £' to be scaled 
elements of the Pauli group, since unitary combinations 
of Pauli group elements do not necessarily belong to the 
Pauli group.) 

To state the optimal recovery, we carefully define the 
syndrome measurement subspaces and the Pauli group 
operators that connect the subspaces. We must do this in 
a way to consistently describe the normalizer operations 
of the code. Consider an [n, k] stabilizer code with gen- 
erators (gx, . . . , g n -k) and logical Z operators Z\,...Zk 
such that {<7i, . . . , g n -k, Z\, . . . Zk} form an independent 
and commuting set. Define logical X operators such that 
[Xi,g£ = [Xi,Xj] = V [Xi, Z s \ = for % ± j and 
{X u Zi} = 0. 

The syndrome subspaces correspond to the intersection 
of the ±1 eigenspaces of each generator. Accordingly, we 
label each space S q where q = 0, 1, . . . , 2 n ~ k — 1, where 
So corresponds to the code subspace. Let P q be the pro- 
jection operator onto S q . Let {|iiZ2 ■ ■ ■ form a basis 



17 



for S q such that 

Z X Z 2 ■ ■ ■ Z k \ H i 2 ■ ■ ■ i k ) q = (-1)* 1 (-1) 42 • • ■ (— l) ifc \ii%2 ■■■ik) 

(53) 

where ij £ {0, 1}. In this way, we have a standardized ba- 
sis for each syndrome subspace which can also be written 
as {|m) g }, m = 0, . . . , 2 k — 1. 

Let us recall the effect of a unitary operator on a sta- 
bilizer state. If \ip) is stabilized by (g\, . . . ,(?„_&), then 
U \tp) is stabilized by (UgiW, . . . , Ug n ^ k U^). What hap- 
pens if U € G„, the Pauli group on n qubits? In that 
case, since U either commutes or anti-commutes with 
each stabilizer, U is stabilized by (±gi, ■ ■ ■ ,±g n —k) 
where the sign of each generator gi is determined by 
whether it commutes or anti-commutes with U. Thus, 
a Pauli group operator acting on a state in the code sub- 
space Sq will transform the state into one of the subspaces 
S q . 

We have established that the Pauli group errors always 
rotate the code space onto one of the stabilizer subspaces, 
but this is not yet sufficient to determine the proper re- 
covery. Given that the system has be transformed to 
subspace S q , we must still characterize the error by what 
happened within the subspace. That is to say, the er- 
ror consists of a rotation to a syndrome subspace and a 
normalizer operation within that subspace. 

Let us characterize these operations using the bases 
{\m) q }. Define W qq > = J2 m \ m )q'q( m \ as the opera- 
tor which transforms S q <— > S q > while maintaining the 
ordering of the basis. Define the encoding isometry 
Uc = J2m l n )os(^| where \n)s G Hs, the source space. 
Further define U cq = W q Uc, the isometry that encodes 
the q th syndrome subspace. We will define the 4 fc code 
normalizer operators as 

A p ee XI 1 X] 2 • • • X l k k Zf Zf ■■■Zl" (54) 

where p is given in binary as i\i 2 ■ ■ ■ ik.j1.j2 ■ ■ • 3k- No- 
tice that if a similarly defined is an element 
of the Pauli group Q k e C{Hs) with generators 
(Xf , . . . , Xjjj, Zf, ... , ), we can conclude A p Uc = 

The preceding definitions were chosen to illustrate the 
following facts. First, we can see by the definitions that 
[W gg ',Ap] = 0. That is, W qq > characterizes a standard 
rotation from one syndrome subspace to another, and 
A p characterizes a normalizer operation within the sub- 
space. These have been defined so that they can occur in 
either order. Second, let £' be a quantum channel repre- 
sented by operator elements that are scaled members of 
the Pauli group Q n . Then the composite channel £ which 
includes the encoding isometry Uc can be represented by 
operator elements of the form 

{E pq = a pq A p W q U c = cipqApUcq}, (55) 

where the CPTP constraint requires J2 pq \ a pq\ 2 = 1- 

We can understand the amplitudes a pq by noting that 
with probability |a p(3 | 2 , the channel £ transforms the orig- 
inal state to S q and applies the normalizer operation A p . 



To channel-adaptively recover, we project onto the stabi- 
lizer subspaces {S q } and determine the most likely nor- 
?malizer operation for each syndrome subspace S q . Let 
p q — argmaxp |a piJ | 2 , and let a q ee a p q . With these defi- 
nitions in place, we can state the following theorem: 

Theorem 3. Let £ be a channel in the form of H55\ ). 
i.e. a stabilizer encoding and a channel with Pauli group 
error operators. For a source in the completely mixed 
state p = I/ds the optimal channel- adapted recovery op- 
eration is given by 1Z ~ {U Cq A Pq }, which is the stabilizer 
syndrome measurement followed by maximum likelihood 
normalizer syndrome correction. 

Proof. We prove Theorem [3] by constructing a dual fea- 
sible point Y such that the dual function value trY is 
equal to the entanglement fidelity F e (p, 1Z o £). 

We begin by calculating F e (p, 1Z o £). For later conve- 
nience, we will do this in terms of the Choi matrix Ce,s 
from We write the entanglement fidelity in terms of 
the recovery operator elements \U Cq A Pq )}: 

F e (p,TZo£) = tYX n C E . £ (56) 

= J2(( U k' A P q '\ C E,sPl q 'ApJ.(57) 

q' 

To evaluate (|5"7| , we note that 

{{pU Cq Ap\U Cq ,A Pq ,)) - trA p U Cq pU Cql A Pq , (58) 

= tTA p W q UcpU c wl,A Pq , (59) 

= txA p Wl,W q U c pU ] c A Pi ,{m) 

= 8 qq > tr ApUcpU^Ap^ (61) 

= 6 qq ,trA%pA$ q ,. (62) 

We have used the commutation relation [W qq i , A p ] = 
to arrive at (|6H)) and the facts that W q ,W q = S qq 'Po 
and PqUc = Uc to conclude (|6"T|) . Since p — I/ds 
and trApAp, = S PPq ,ds, we see that tr A p pAp ( = S PPq , . 
Thus, 

((pUl q Ap\U Cq ,A Pq , P q ,)) = S PPq , 8 qq/ . (63) 
Using (|63|) . it is straightforward to evaluate {57|: 

F e (p : Ko£) = Y,\ a pMP U k A p\ U k' A pJ\ 2 ^ 

pqq> 

pqq' 

= Ei^i 2 - ( 66 ) 

q 

We now propose the dual point Y — J2 q \ a q\ 2 Pq/ds- 
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Since 

trF = ^|5 9 | 2 tr^/d s (67) 

= Em 2 ( 68 ) 

= F e {p,KoE), (69) 

we complete the proof by demonstrating that 

/ ® Y - C E , £ > 0, (70) 

i.e. Y is a dual feasible point. We show this by demon- 
strating that I®Y and Ce s have the same eigenvectors, 
and that the associated eigenvalue is always greater for 
I®Y. 

By the same argument used for (|63|) . we note that 

((pUl q A p \pUl q ,A p ,)) = W/4- (71) 



This means that \pU^, q A p )) is an eigenvector of Ce,s with 
eigenvalue \a pq \ 2 /ds. We normalize the eigenvector to 
unit length and apply it to I ® Y: 

I®Y\pUl q A p /d s )) = J2K\ 2 P^/ds\pU^ Cq A p /dm 

= ^EmWa^'/w) 

q' 

= ±-Va q \ 2 \pUl q A p /ds)). (74) 

Thus we see that \pUq A p )) is an eigenvector of I®Y with 
eigenvalue \d q \ 2 /d s > \a pq \ 2 / 'ds V p. Thus Ii&Y— C E ,s > 
and Y is a dual feasible point. □ 



